Estimating Internal Variables and Parameters of a Learning Agent by a Particle Filter

نویسندگان

  • Kazuyuki Samejima
  • Kenji Doya
  • Yasumasa Ueda
  • Minoru Kimura
چکیده

When we model a higher order functions, such as learning and memory, we face a difficulty of comparing neural activities with hidden variables that depend on the history of sensory and motor signals and the dynamics of the network. Here, we propose novel method for estimating hidden variables of a learning agent, such as connection weights from sequences of observable variables. Bayesian estimation is a method to estimate the posterior probability of hidden variables from observable data sequence using a dynamic model of hidden and observable variables. In this paper, we apply particle filter for estimating internal parameters and meta-parameters of a reinforcement learning model. We verified the effectiveness of the method using both artificial data and real animal behavioral data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimating Internal Variables and Paramters of a Learning Agent by a Particle Filter

When we try to model a higher order functions, such as learning and memory, we face a difficulty of comparing neural activities with hidden variables that depend on the history of sensory and motor signals and the dynamics of the network. Here, we propose novel method for estimating hidden variables of a learning agent, such as connection weights from sequences of observable variables. Bayesian...

متن کامل

Unscented Auxiliary Particle Filter Implementation of the Cardinalized Probability Hypothesis Density Filters

The probability hypothesis density (PHD) filter suffers from lack of precise estimation of the expected number of targets. The Cardinalized PHD (CPHD) recursion, as a generalization of the PHD recursion, remedies this flaw and simultaneously propagates the intensity function and the posterior cardinality distribution. While there are a few new approaches to enhance the Sequential Monte Carlo (S...

متن کامل

A New Modified Particle Filter With Application in Target Tracking

The particle filter (PF) is a novel technique that has sufficiently good estimation results for the nonlinear/non-Gaussian systems. However, PF is inconsistent that caused mainly by loss of particle diversity in resampling step and unknown a priori knowledge of the noise statistics. This paper introduces a new modified particle filter called adaptive unscented particle filter (AUPF) to overcome th...

متن کامل

Development of a particle filter framework for respiratory motion correction in nuclear medicine imaging

This research aims to develop a methodological framework based on a data driven approach known as particle filters, often found in computer vision methods, to correct the effect of respiratory motion on Nuclear Medicine imaging data. Particles filters are a popular class of numerical methods for solving optimal estimation problems and we wish to use their flexibility to make an adaptive framewo...

متن کامل

Real Time Calibration of Strap-down Three-Axis-Magnetometer for Attitude Estimation

Three-axis-magnetometers (TAMs) are widely utilized as a key component of attitude determination subsystems and as such are considered the corner stone of navigation for low Earth orbiting (LEO) space systems. Precise geomagnetic-based navigation demands accurate calibration of the magnetometers. In this regard, a complete online calibration process of TAM is developed in the current research t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003